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Abstract We analyze the occurrence and the values 
of record-breaking temperatures in daily and monthly 
temperature observations. Our aim is to better under- 
stand and quantify the statistics of temperature records 
in the context of global warming. Similar to earlier work 
we employ a simple mathematical model of independent 
and identically distributed random variables with a lin- 
early growing expectation value. This model proved to 
be useful in predicting the increase (decrease) in upper 
(lower) temperature records in a warming climate. Us- 
ing both station and re-analysis data from Europe and 
the United States we further investigate the statistics 
of temperature records and the validity of this model. 
The most important new contribution in this article 
is an analysis of the statistics of record values for our 
simple model and European reanalysis data. We esti- 
mate how much the mean values and the distributions 
of record temperatures are affected by the large scale 
warming trend. In this context we consider both the 
values of records that occur at a certain time and the 
values of records that have a certain record number in 
the series of record events. We compare the observa- 
tional data both to simple analytical computations and 
numerical simulations. We find that it is more difficult 
to describe the values of record breaking temperatures 
within the framework of our linear drift model. Obser- 
vations from the summer months fit well into the model 
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with Gaussian random variables under the observed lin- 
ear warming, in the sense that record breaking temper- 
atures are more extreme in the summer. In winter how- 
ever a significant asymmetry of the daily temperature 
distribution hides the effect of the slow warming trends. 
Therefore very extreme cold records are still possible in 
winter. This effect is even more pronounced if one con- 
siders only data from subpolar regions. 

Keywords Records • Record temperatures • Extreme 
value statistics • Extreme weather • Climate Change • 
Stochastic processes 
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1 Introduction 

In the context of global warming, record-breaking tem- 
peratures have received considerable attention recently. 
In newspapers and in television one frequently hears of 
hottest summer days, extreme heat streaks, or record 
breaking storms. Extreme and record breaking weather 
events are not only interesting for the observer but can 
also have a big impact on agriculture, economy and 
human life. If one considers record-breaking events in 
climatology, these should of course always be seen in 



2 



G. Wergen et al. 



25 



20 



15 



10 



Measurements 
Upper record value 
Lower record value 



r 



* . • . • • • 

'.•«•• I' \ 



1800 



1850 



1900 
Year 



1950 



2000 



Fig. 1 Daily maximum temperature measurements for 
November 1st in Prag for the time span from 1772 to 2010. 
The figure illustrates the progression of the upper and lower 
record values. The full red line gives the progression of the 
highest temperature in the time-series and the dotted blue 
line the progression of the lowest one. In each case the num- 
ber of steps is the respective upper or lower record number. 
The statistics of those record progressions, the number and 
the height of the steps is the subject of this paper. 



the context of global warming. The crucial question is: 
How much are the climate records we e ncounter today 
affect ed by the evident climatic change (jSolomon et all . 
20071) of the last decades? 

While the study of extremal events in ge neral is very 
impor tant i n climatology (Easterling et al, Il997t Stott 
et al 



20041: Brown et al, l2008t Cat tiaux et al, 12009 : 



Meehl et al, l2008t Min et al. l2009h . record tempera- 
tures have not received much attention until recently. 
Even though it is intuitively clear that increasing tem- 
peratures should result in a higher than average number 
of hot day records, this effect was not studied and de- 
tected in observational data for a long time. However, 
there has been some research on the statistics of tem- 
perature records in the last years. Redner and Peterson 
(200G) analyzed the statistics of record temperatures 
for daily temperature measurements from Philadelphia 
over a 126 year time-span. Due to the fact that they 
only used data from a single station, their analysis did 
not establish a significant connection between the in- 
crease of global mean temperatures and local record- 
breaking events, but made important theoretical contri- 
butions to the subje ct. There is also some earlier work 
by Benestad ([2003! ) . who analyzed the occurrence of 
records for stations in Scandinavia. 
Meehl et al EoQ£ 



found a significant effect of slowly 
changing temperatures on the occurrence of records for 
daily temperatures at weather stations in the United 
states. In particular they demonstrated that in an anal- 
ysis starting from 1970 the rate at which upper records 



occurred declined more slowly than the rate of lower 
records. In 2010 two of us obtained sim i lar re sults in an 
independent study ( Wergen and Kruel 2010h . We per- 
formed an extensive analysis of European and American 
station data and combined it w ith a simple mathemat- 
ical model ( Franke et al . 2010[) to quantify the effect 
of slow temperature changes upon the records of daily 
temperatures. For the European station data covering 
the time period 1976-2005, wc found that on average 
5 of the 17 high temperature records recorded at one 
station in one year can be attributed to the observed 
slow increase of temperature. 

Our findings and our analytical mod el we re dis- 
cussed and confirmed by Elguindi et al <|2012f) using 
gridded data from regional climate models. They also 
made predictions for the spatial distribution of record 
temperatures in Europe for the future based on model 
data from the ENSEMBLES project. Newman et al 
(2010) analyzed record breaking temperatures at a very 
high resolution from the Mauna Loa Observatory on 
Hawaii for the time-span from 1977 to 2006. They also 
presented evidence for slowly increasing temperatures 
in the occurrence of records. However, in their data 
they found that while the rate of cold records is signif- 
icantly decreased, the number of hot record s remained 
unchanged. Rahmstorf and Coumou (|2011l ) considered 
monthly mean temperatures from a weather station in 
Moscow and could show that the number of hot records 
in these mean values increased significantly. They also 
discussed the effect of climatic change on the occur- 
rence of global-mean temperature records based to a 
large extent on numerical experiments. 

The purpose of this paper is to provide a detailed 
analysis of the statistical properties of record-breaking 
temperatures also from a theoretical point of view. The 
main idea behind this analysis is illustrated in Fig. [T] 
We consider the progression of the records and record 
values in time series of temperature measurements for 
individual calendar days and months. This way, the 
daily measurement are always one year apart from each 
other and, to a good approximation, their statistics can 
therefore be compared to uncorrelated random vari- 
ables. We use both station and gridded re-analysis data 
of daily and monthly mean and maximum temperatures 
to get a more complete picture in space and time. 

In particular we will consider monthly mean tem- 
peratures at single stations for the continental United 
States and gridded temp erature data for Europe. In 
(jWergen and KrusJ . 120101 ) we already considered daily 
station data from the United States and had difficulties 
to quantify the effect of slow changes in temperature 
with our model. In this paper we will find that if one 
analyses monthly mean values both increase of the up- 
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per record number and the decrease of the lower record 
number is much more pronounced than in the daily 
data. The reason for this is the strongly reduced level 
of variability in the monthly averages. For the European 
daily data we already found a strong effect of slowly in- 
creasing temperatures since the increase of the mean 
temperature was stronger and the standard deviation 
was smaller in this data set. Therefore and because of 
the high densit y and homogeneity of the gridded tem- 
perature data (jHavlock et a 1, 2008) we decided to an- 
alyze the statistics of record values of these European 
data. 

At this point we will give a brief outline of the arti- 
cle: Before we introduce and discuss the different data 
sets, we will give an overview and some new results 
on the record statistics of independent and identically 
distributed (iid) random variables (RV's) with a lin- 
ear drift. Th is Line ar Drift Model (LDM) discussed by 
Franke et a 1 (2010) is very important for our under- 
standing of temperature records. The results for the 
occurrence of records in the LDM have been published 
before, but we briefly describe them to make the article 
self contained. 

In section 3 we introduce the data sets that were 
mentioned above. We describe the statistical properties 
of the measurements and their time-dependence and 
discuss how well the observational data fit the LDM. 
For that purpose we analyzed the time-dependence of 
the mean and the standard deviation of the recordings. 
In the following section 4 we will then first consider 
the occurrence of records in the different data sets, in 
particular the record rate in the daily and monthly tem- 
perature recordings. We analyzed the record rate in the 
United States with respect to the different seasons to 
find out when the effect of slowly evolving tempera- 
tures on the record statistics is strongest. In particular 
we will discuss the ratio between the number of upper 
records and the number of lower records and compare 
it to the predictions from our analytical model. 

Section 5 is again about theoretical aspects. Here 
we will discuss the statistics of record values within 
the LDM using both an analytical approach and nu- 
merical simulations. We quantify the effect of the lin- 
ear drift on the mean value and the standard deviation 
of record values that occur at a certain time step and 
of record values that have a certain record number in 
the series of record events. The aim is to understand 
if the record events we have to expect in the presence 
of slowly increasing temperatures are more extreme or 
more variable than without any climatic change. 

In the subsequent section 6 we will then compare 
the findings of section 5 to the observational data we 
already introduced and discussed in sections 3 and 4. 



In section 6 we will consider the behaviour of the mean 
values of record events as well as their full distributions. 
For that purpose we will introduce a simple rescaling 
of the observational data to account for seasonal and 
spatial variations in the standard deviation of the time 
series. Then we will discuss both the values of records 
that occur in a given year as well as the values of records 
that have a certain record number. 

Finally in section 7 we summarize and evaluate our 
findings and discuss them in the context of ongoing re- 
search in the field of temperature records. 



2 Theory: Record occurrence in the presence of 
linear drift 



2.1 Record statistics of iid. RV's 

We consider time series of uncorrelated random vari- 
ables (RV) Xk from continuous probability densities 
fx k (%k) , k G {1, 2 . . . , n}. As mentioned earlier, an up- 
per (lower) record in the nth step is an entry X n that is 
larger (smaller) than all previous entries Xk with k < n. 
The basic properties of recor d events in such time series 
can for instance be fo und in ( Arnold et al . 19981 Glick . 
1978tlNevzorovll2000h . In the special case of identically 
and independently distributed (iid) RV's from a sin- 
gle probability density fx {x) the probability that the 
nth entry is a record is simply Riven by P n — 1/n (cf. 
(jArnold et at Il998i: iNevzorovl boool) ). This holds, be- 
cause of the symmetry of the problem, both for upper 
and lower records. From now on we will call the proba- 
bility P n that the nth entry in the series is a record the 
record rate. In the iid case the mean number of records 
R n up to the nth step can be obtained by summing 
over the record rate and for large n we find: 



fe=i 



Pk 



^ k 

k=l 



In (n) + 7. 



(1) 



Here, 7 sa 0.577215 . .. is th e Eule r-Mascheroni constant 
( Arnold et al 1998 ; Glick , 1978 ) . In this case, one can 
prove that record events are uncorrelated ( Wergen et all . 
201 lh and, if one goes to a logarithmic time scale one 
finds th at the y form a Poisson process (Sibani and Lit- 
tlewood ll993l) . Another important feature of this result 
for records of iid RV is that P n and R n are completely 
independent of the shape of the underlying probability 
density fx (x). 



2.2 Linear Drift 

In general, when the RV's Xk are not identically dis- 
tributed, it is more difficult to compute the record rate 
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P n . For independent RV's X k from a series of arbitrary 
continuous distributions fx k (xk ) t he upper reco r d rate 
is given by the following integral (lArnold et a | Il998l: 
GrickLll978h : 



/n— 1 
dx n f Xn (x n ) J] F Xk (x n ) (2) 
fc=l 

where Fx k (x n ) is the probability distribution function 
of f Xk (x k ) with F Xk (x n ) = J Xn dx f Xk (x). Here, the 
integrand is just the probability that the nth entry has 
a value of x n times the probability that all previous en- 
tries are smaller, which is represented by the product. 
This probability is then integrated over all possible val- 
ues for a record in the nth step x n . Analogous to this 
the lower record rate P* is given by: 



(3) 



/n—l 
dx n f Xn (x n ) JJ (1 - Fx k (x n )) . 
fe=i 

The LDM w as first considered by Balleri ni and Resnick 
( 1985Hl987t ) and later also by Borovkov |l999l) . In this 
model we consider iid RV's Y k with a linear drift of the 
following form: 



X k = Y k 



ck, 



(4) 



with a constant c. In this case fx k (x) is simply given 
by fx k {x) = f{x — ck) with fixed / (x). The underly- 
ing distributions have all the same shape, but the mean 
value increases with a constant speed c. This model was 
used before to better understand the statistics of ath- 



letic records (Ballerini and Rcsnick.119851: Gcmbris et al, 
20071 ). but we showed that is also capable of describing 
the occurrence of recor ds in d aily temperature record- 
ings (Wergen and Krug, 20101 Rahmstorf and Coumou, 
201 ll ). By considering ^ one finds that for any con- 
stant drift c the record rate in the LDM is of the fol- 
lowing form: 



Pn{c) 



/OO 
dx f(x)J[F(x + ck) 
k=i 



(5) 



Most interesting for us is the statistics of records for 
a drift velocity c much smaller than the width of the 
probability distribution / (a;). In most cases this width 
is just the standard deviation a of the probability dis- 
tribution. Performing a series expansion of ([5]) in the 
regime of c <C n/a one finds the followi ng approxima- 
tion for P n (c) ( Wergen and Kruel . 2010l ): 



P 



1 c n(n - 
(c) "n + ^ 



1) 



dy f (y)F n ~ 2 (y). 



(0) 



Franke et all ( 2010l) evaluated this expression for distri- 
butions from all thr ee cla sses of extreme value statis- 
tics (Gala mbos et al, 1994]; De H aan and Ferreira, 2006 : 
Nevzorov,|2003 G umbel. Il954l) . The dependence of the 
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Fig. 2 Annual mean temperature (top) and standard devia- 
tion (bottom) of monthly mean temperatures from contiguous 
US stations 1960-2010. Dots are one year averages, full lines 
give a five year running mean. The anomalies defining the 
standard deviations of the mean temperature were computed 
by subtracting the mean and linear trend 1960-2010 individ- 
ually for each month. Annual mean values increase over the 
last 50 years with an average rate of 0.018 K°/y, whereas the 
standard deviation of the monthly anomalies remained more 
or less constant. 



record rate P n (c) on the drift c is systematically differ- 
ent between the three classes, but also within them one 
can find differences between different individual prob- 
ability distributions. For the Weibull class of probabil- 
ity distributions with a finite support the effect of the 
linear drift on P n (c) was found to be strong and in- 
creasing with n. For the distributions of the Frechet 
class with power-law tails it decays with growing n and 
vanishes for n — > 00 ( Franke et all . 2010h . The behavior 
of P n (c) in the Gumbel class of distributions with an 
infinite support that decay faster than a power law is 
intermediate between these two cases. For a simple ex- 
ponential distribution with / (x) = l/ve~ x / v the effect 
of the drift on the record rate is independent of n and 
we have P n (c) w - + cvjla. 
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Most interesting for our applications is P n (c) for a 
Gaussian distribution of the following form: 

1 



/(*) 



27R7 



(7) 



Here, fi is the mean value of the probability distribution 
and a its standard deviation. The approximate evalua- 
tion of Eq. © for large enough n with c< n/d and a 
Gaussian distribution with standard deviation a yields: 



1 c 2^/rF I fn 2 

p n (c )«- + -- Win hr 



(8) 



Franke et al d2010l) compared this approximation to nu- 
merical simulations and good agreement was found for 
n > 7 and c <C n/cr. The result was applied to the 
statistic s of record-breaking tem peratures for the first 
time by Wergen and Krug ( 2010l ). who showed that the 
regime of cj a <C 1 is appropriate for the modeling of 
daily temperatures in a climate with moderate warm- 
ing like that occurring in the last decades. This result 
will also be employed for comparison with observational 
data in section 4 of this article. 

Another complication that can arise in the context 
of record statistics of historical temperature recordings, 
is the problem of rounding. The fact that, for tech- 
nical reasons, temperature measurements can only be 
recorded up to a certain degree of accuracy opens up 
the possibility for ties. Usually the observations are 
rounded down (or up) to a certain value kd, where d 
is a discretization length (e.g. 0.1 K° if temperatures 
are stored up to the firs t digit) and k an integer num- 
ber. Wergen et al ( 20121 ) recently explored the interest- 
ing and manifold consequences of this rounding for the 
statistics of records in time series of iid RV's in the con- 
text of the three universality classes of extreme value 
statistics. 



3 General introduction of the data 

We focus our analysis on two different sets of data from 
Europe and the contiguous United States. The purpose 
of this section is to analyze the distributional properties 
of these data sets. We want to know if the LDM dis- 
cussed in the previous section is applicable to the obser- 
vational data. In particular we will examine if a Normal 
or Gaussian probability density function with a slowly 
changing expectation value is a reasonable approxima- 
tion. Therefore we are interested in the linear trend c 
of the daily and monthly temperature averages and the 
standard deviation a around that transient mean value 
fi + ck. As outlined above the most important quantity 
in our analysis is the normalized drift cj a. 
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Jun, Jul, Aug - c = +0.004 K/y 
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Fig. 3 Time series of seasonal averages of monthly mean 
temperatures from contiguous US stations 1960-2010. For 
clarity, three year moving averages are plotted. Inset shows 
the estimated trends over the period 1960-2010. Winter 
months exhibit the strongest warming with c = 0.043 K° /y, 
during spring and fall the warming is moderate with c = 
0.023 K°/y and c = 0.007 K°/y and in the summer months 
we find only a very small warming of c = 0.004 K°/j/. 
However for the normalized drift and therefore the record- 
statistics also the standard deviation is important, which is 
significantly smaller in the summer as well (see text). 




-2-10123 
Percentiles of the (rescaled) EOBS2 data 

Fig. 4 Q-Q plot of the linearly detrended and normalized 
daily maximum temperatures in the EOBS data (EOBS2) 
with RV's sampled from a Gaussian distribution with stan- 
dard deviation unity. The percentiles collapse on a line show- 
ing that the rescaled measurements are Gaussian to a good 
approximation. The inset shows the estimated probability 
density of the data compared to a Gaussian (dashed line). 



3.1 US monthly station data 

We obtained monthly mean temperatur es from 1217 
weath er stations of the contiguous US ( Menne et all . 
2010f) . The data cover the period 1895 and 2010, but 
but not all stations are complete. Therefore we decided 
to analyse the 50 year period 1960 to 2010 (data set 
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USM) which is the period with least missing data. This 
way we could con sider 1217 x 365 time ser ies. Based on 
the discussion in ( Wergen and Krug , 2010h we estimate 
that the number of effectively independent time series 
in this data set is much smaller than the total number 
of series. The number of independent stations is limited 
becaus e of correlations both in space and in time. Fol- 
lowing |Werj^mjmd_J^rujg (|2010h we estimate not more 
than 20-25 independent stations and around 36 inde- 
pendent calendar days, leading to a total number of 
around 700-900 independent time series. 

As one can see in Fig. [2] the monthly averages show 
a clear increase in the mean value with a drift of c = 
0.019 ± 0.005 K°/y. In contrast, the standard devia- 
tion of the monthly averages remains constant around 
a = 1.88 ± 0.03 K°, which is smaller than the standard 
deviation of the dai ly measurements of arou nd 5 K° 
for the same period ( Wergen and Krug . 2010h . Using a 
linear regression analysis we tried to detect a trend in 
the standard deviation, but a only found a insignificant 
trend of -0.0007±0.0021 K°. Assuming the constant a, 
we obtain a normalized drift c/a « 0.010 ± 0.003 y^ 1 , 
which is almo st as large as the c/a fo r the European 
daily data in |Wergen and Krugj . I2OI0I) . 

The seasonal dependence of the normalized drift c/a 
is presented in Fig. [3] During winter the US mean tem- 
perature increased most strongly, while the trend for 
summer is much smaller. However, important for the 
record statistics is the ratio c/a. The normalized trends 
are listed in Tab. [TJ Based on the theoretical results we 
expect a strong effect of the slow temperature increases 
on the record rates in winter and spring in contrast to 
summer and fall if all prerequisites of the theory are ful- 
filled. This is due to the strong difference in normalized 
drift (Tab. [TJ third column). 





c 


a 


Winter 
Spring 
Summer 
Fall 


0.043 ± 0.015 K°/y 
0.023 ± 0.008 K°/y 
0.004 ± 0.007 K°/y 
0.007 ±0.005 K°/y 


2.45 ± 0.012 K° 
1.68 ± 0.006 K° 
1.70 ± 0.006 K° 
1.56 ±0.005 K° 




c/a 




Winter 
Spring 
Summer 
Fall 


0.017 ±0.006 y~ L 
0.014 ±0.005 y- 1 
0.002 ± 0.004 y- 1 
0.004 ±0.003 y- 1 





Table 1 Averaged drift c, averaged monthly standard devi- 
ation <r and averaged normalized drift c/a for the contiguous 
US stations 1960-2010. While in winter and spring there is a 
significant and strong drift in the data, we find only little or 
no effects in summer and fall. 
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Fig. 5 Normalized record rate nP n of monthly mean tem- 
perature data from US stations. The dots represent the an- 
nual averages of nP n and the lines represent a five-year run- 
ning average. The line at one is the behaviour expected in 
the case of a stationary climate with realizations from an iid 
process. Before 1980 it is hard to distinguish between the up- 
per and lower record rate. After 1980 the upper record rate 
increases and the lower record rate decreases. 



3.2 European daily data 

The second dat a set we obta i ned a re taken from the 
E-OBS project (|Havlock et all . l2008l) . The E-OBS data 
set provides daily minimum, mean and maximum tem- 
peratures on a 0.25 degree regular grid for most of Eu- 
rope and also parts of northern Africa starting from 
1950 to the end of 2010. We analyzed the daily max- 
imum temperature data set for two time periods, one 
between 1950 and 2010 (data set EOBS1) and a sec- 
ond one between 1980 and 2010 (data set EOBS2). 
Again, it is important to mention, that despite the fact 
that wc analyzed a much larger number of time series, 
only around 300-400 series were effectively independent. 
This number is smaller in Europe due to the smaller size 
of the considered area. The time series analysis of the 
mean of the daily maximum temperature and the stan- 
dard deviation were not sign ificantly different from th e 
station data considered in ( Wergen and Kruel 2010[) . 
During the first 30 years of EOBS1 the mean tempera- 
ture shows no prominent trend. During the second pe- 
riod, between 1980 and 2010 (EOBS2), we find a clear 
warming trend of about c = 0.042 K°/y. In data set 
EOBS2, the standard deviation of the daily tempera- 
tures around the linear trend remained constant at a 
value of about 3.4 K° resulting in a normalized drift 
of around c/a w 0.012 ± 0.003 y -1 , which is slightly 
smaller than that obtaine d from European station data 
( Wergen and Krugl . I2OI0I ) . 

In Fig.[¥]we present a quantilc-quantilc (Q-Q) anal- 
ysis of the distribution of the detrended and normalized 
temperature measurements in data set EOBS2. We sub- 
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tracted a linear trend, which was obtained by a regres- 
sion analysis, from the time-series at each grid point 
and then normalized them by dividing with the stan- 
dard deviation of the measurements in the individual 
series. Then the percentiles of the measurements were 
plotted against the corresponding percentiles of a Gaus- 
sian distribution with mean zero and standard devia- 
tion unity. Apparently the measurements are Gaussian 
to good approximation and the data therefore fits our 
model of a Gaussian with a linear drift. The inset shows 
an estimation of the probability density function based 
on a simple histogram estimator with a bin width of 
0.01 K°. 



4 Occurrence of records in daily and monthly 
temperatures 





Rec. ratio - US stations 


Ratio - LDM 


Winter 


7.6 


7.1 ±5.6 


Spring 


9.4 


4.8 ±3.7 


Summer 


2.4 


1.2 ±0.7 


Fall 


1.6 


1.5 ±0.4 



4.1 US monthly station data 



Wergen and Krue ( 2010h showed that the record rate 
P n of daily US station temperatures does not display 
a significant effect of the slowly evolving mean temper- 
ature. When one considers monthly data this changes 
basically due to the reduced variance. In Fig. [5] we show 
the normalized record rate nP n for monthly means. 
Apparently, for the years after 1980, the number of 
high temperature (upper) records is above the station- 
ary case of nP n = 1 and the number of low tempera- 
ture (lower) records is decreased. Compared to the null- 
hypothesis of a stationary climate, in the years between 
2000 and 2010, we found around 1.52 times the number 
of hot temperature records and only half (0.48) the cold 
records one would expected. Therefore, the number of 
upper records was about 3.13 times the number of lower 
records. 

These results are in good agreement with the Gaus- 
sian LDM we described above. For the normalized drift 
c/er w 0.01 obtained from our data, the LDM 
predicts 1.44 ± 0.13 times more upper records than 
expected in a stationary model and only 0.50 ± 0.15 
the number of lower records. The ratio between up- 
per and lower records in the last 10 entries should be 
2.88 ± 1.12, which is also in agreement with the obser- 
vational data. Interestingly, without the very cold year 
of 2009, these numbers become more extreme. Ignoring 
the 2009 data, the ratio between the number of upper 
and lower records in the last ten years is around 4.9. 

Additionally we can consider the record rate for the 
different seasons. We computed the ratios between up- 
per and lower records for the four seasons and compared 
them to the estimates of c/a and the resulting ana- 
lytical predictions from the LDM with Gaussian RV's. 
The results of this analysis are shown in Tab. [2J As 



Table 2 Ratio between the number of upper and lower 
records after n = 50 years in the monthly data from the 
US stations along with the corresponding predictions from 
the LDM. The values for the normalized drift c/cr were taken 
from Tab. [T] 



expected mostly the winter and spring months expe- 
rience a strong effect of slow temperature increase on 
the record statistics. In spring a heat record during the 
last ten years of the observational period was almost 
ten times as likely as a cold record. In winter this fac- 
tor is almost eight. However, also in summer and fall 
the moderate warming lead to a significant effect in the 
statistics of records. Interestingly in spring and sum- 
mer the ratio between the upper and lower record rate 
is larger than predicted by the Gaussian - LDM, but 
given the large fluctuations in the data, this can very 
well be a coincidence. It is also interesting to notice, 
that both in winter and in spring less then 20 % of the 
cold records that we would have expected in the case 
of a stationary climate actually occurred. 



4.2 European daily re-analysis data 

We analyzed the record rates for the two time spans 
chosen for EOBS1 and EOBS2 (1950-2010 and 1980- 
2010) and compared them to the analytical predictions 
from our linear drift model. The results can be found in 
Fig. [5] Considering the data set EOBS1 we find that in 
the first 30 years of the observation period, there was 
no significant effect of slowly increasing temperatures 
on the record statistics. Between 1950 and 1980, both 
the rate of upper and lower records behaved roughly 
like the record rate of iid RV's. After 1985 the upper 
record rate is significantly larger than the lower record 
rate, with the lower record rate significantly decreasing. 
For the years after 2000 the ratio between upper and 
lower records exceeds two and even approaches a value 
of three at the end of the observational period. 

In the data set EOBS2, the averaged upper record 
rate was higher than the lower record rate for almost the 
entire time span between 1980 and 2010. At the end of 
the observational period there were twice as many up- 
per records as there were lower records. Here, the pre- 
dictions from our linear drift model are very accurate in 
predicting the effect of the warming on the occurrence 
of both upper and lower records. 
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Fig. 6 Upper figure: Records per station and year in the 
daily data set EOBS1 (1950-2010). Red represents the oc- 
currence of upper records and blue the occurrence of lower 
records. The crosses give the annual average and the lines a 
five-year running average. The black dashed line is the 365/n 
behavior we would have expected in the case of a stationary 
climate. Until 1980 the upper and lower record rate are hard 
to distinguish and do not vary much from the stationary be- 
havior. After 1980 the rate of upper records ceases to decrease 
and approaches a constant value of about 12 records per year, 
while the rate of lower records decreases faster than the 365/n 
line. Lower figure: The same analysis for EOBS2 (1980- 
2010). Here the upper record rate is significantly increased 
compared to the no- warming model and the lower record rate. 
Red and blue dashed lines are the analytical predictions with 
a normalized annual mean trend of c/cr = 0.012y estimated 
from the data. 
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Fig. 7 Excess number of records that occurred since the 
year 1995 per grid point per year in data set EOBS1. The 
record temperatures were recorded from the beginning of the 
time series. For this figure we only summed up records that 
occurred after the beginning of 1995. The records that con- 
tribute to this figure are the ones that exceeded the record 
that was valid at the end of 1994. 

5 Theory: Distributions of record values 

After discussing the occurrence of record events in the 
previous sections, we now turn to the record values 
themselves. There are in principle two ways to study 
the effect of a slowly changing mean value on the record 
values. One approach is to consider the probability den- 
sity function (pdf) of record values with a fixed record 
number k, which can however happen at an arbitrary 
time n. The other is to study pdf's of record events 
occurring at a fixed time step n. We do not know any 
simple way of computing the pdf's of the fcth records in 
the presence of linear drift analytically. We will discuss 
the iid case and present results of some numerical simu- 
lations. If we consider instead the probability densities 
of record values for records occurring at a fixed time n 
we can use the methods described above to compute a 
small c approximation in the framework of the LDM. 



In Fig. [7] we considered the number of records that 
occurred over a prolonged time span. We analyzed the 
upper and lower record temperatures that occurred in 
data set EOBS1 and started summing up the record 
rate beginning with the year 1995. Even though this 
figure is difficult to compare with our LDM, since we do 
not assume a linear drift over the entire time span from 
1950 to 2010, it shows how strong the effect of warming 
on the record occurrence was in the last decades. To- 
wards the end of the considered period, one finds that 
upper records occurred on average more than 2.5 times 
more often than lower records. 



5.1 Records in the n'th step 

With the general expression for the record rate P n (c) in 
the LDM we gave in section 2, we can also obtain an ex- 
pression for the probability distribution or cumulative 
distribution (cdf) 

Q n (c, x) = Prob [X n is rec. & X n < x] (9) 

of a record that occurs in a certain time step n. This is 
given by: 

Qn (c, x) = —— / dy f x (y) J] F X (y + kc) . (10) 
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The prefactor P n (c) 1 is necessary for the normaliza- 
tion. The corresponding pdf is given by: 

Tl-l 



Pn (C, x) 



1 



Pn(c) 



fx (x) J] F X (x + kc) . 



(11) 



fe=0 



For the iid case (c = 0) these expressions reduce to 
Q n (0, c) = nj x dy f x (y) F x (y) and therefore a pdf 
p n (0, x) = n jx (x) (x), which is well known as the 
pdf of the maximum of n iid RV's. Most interesting and 
important for our analysis of temperature record values 
is the mean value \i n (c) of a record that occurs at time 
n. This is given by the first moment of the pdf p n (c, x): 



(c) 



Pn(c) 



/n—1 
dy yfx (y) J| F x (y + kc 

k=Q 



(12) 



Similar to the case of the record rate P n (c) we can 
compute a series expansion for this expression in the 
regime of cn <C a. Doing this we find 



(J-n (c) ~ (l 
wherc we defined 



and 



M«(0)=n J dy yfx(y)F 



n-l 
X 



(y) 



(13) 



(14) 



(15) 



Apparently In^ is the integral that appears in our result 
for the record rate P n (c) [Eq. (0]. Furthermore n n (0) 
is the mean value of a record that occurs at time n in the 
case of iid RV's. Since we have an exact expression for 
the pdf p n (c, x), we can of course also compute higher 
moments and, for instance, the variance of a record that 
occurs in the LDM at a certain time n. For the variance 
we find: 

c 



a\ (c) « al (0) (l - 



+ ^{n n (0)(l^-I^)+In 2) ) (16) 

From now on we will f ocus on the mean v alue u n (c). 
Again, as in the work of Franke et all ( 2010h it is possi- 
ble to compute fi n (c) for instances of all three classes 
of extreme value theory and we find a systematic clas- 
sification of the behavior with respect to these classes 
(Wergen, 1201 lh . Here, we will focus on the Gaussian 
density, because it is the one we need for our compar- 
ison of the LDM to observational data. For the same 
Gaussian pdf as in section 2, we find that: 

Mc(c)«M0) + n-^ln(4). (17) 

Interestingly, in contrast to the case of the record rate 
P n (c), the effect of the drift on /i„ (c) up to first order 
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Fig. 8 Effect of the drift on the mean value fi n (c) — fi n (0) 
for Gaussian RV's. The crosses are results from numerical 
simulations. For c = and c = 0.01 we performed 10 6 runs 
each with n = 1000 RV's and averaged the differences be- 
tween the mean record values. The lines are the analytical 
results for small and large n. The inset shows a comparison 
with the small n result for n between and 100. 



in c is linear in n. In the limit of cn 3> o we expect that 
n n (c) — n n (0) is again linear in n. It is easy to see that 
for n — > oo we get fj, n (c) — fj, n (0) ps cn. In this regime, 
the drift dominates the behaviour and the mean record 
value fi n (c) is given by the linearly growing mean of 
f (x — cn) plus a sublinear contribution. 

We compared these analytical findings for the Gaus- 
sian density to numerical simulations. The results can 
be found in Fig. [5] Both in the small and in the large n 
regime the analytical predictions describe the behaviour 
of Hn (c) — u n (0) very accurately. For the chosen drift 
rate of c = 0.01 the intermediate regime of cn cx cr, 
where both descriptions given above fail seems to be 
very small. 



5.2 k'th records 

In the case of the record values of a record that occur at 
an arbitrary time n, but with a fixed record number k, 
one can easily give the full distribution of record values 
in the iid case . Here, we wi l l briefl y discuss the findings 
presented in (jArnold et al . 1998). There it was shown 
that the pdf pk (x) of a record that occurs with record 
number k can be written as follows: 

fx (x) 



Pk (x) 



(fc-1)! 



(-ln(l-F x (x))) 



k-l 



(18) 



This result is basically a consequence of the lack-of- 
mcmory property of the exponential distribution, i.e. 
the fact that a new record from an exponential distri- 
bution, independent from the time of its occurrence, 
will always be an exponential RV plus the value of the 
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Fig. 9 Normalized distributions of records with record val- 
ues k occurring at an arbitrary time n for Gaussian RV's in 
the iid case and with a constant drift of c = 0.01. Shown 
are the pdf's for k = 3, 5 and 7. The analytical results for 
the iid case (Eq. (| 181) 'l are plotted as lines, the crosses are 
the numerical results with drift. For each k and c we aver- 
aged over 10 realizations of a time series of length 10 . The 
finite length of the time series does not have a significant ef- 
fect on the distributions of records with the given values of 
k <lnl0 6 +7. 



last record ( Arnold et al , 19981) . This leads to the re- 
sults that for an exponential distribution the fcth record 
has a Gamma distribution pZ p (x) = r [k — 1, a;]. The 
general result given above can then be obtain through 
a simple mapping. 

Unfortunately, for RV's with a time-dependence like 
a linear drift, the lack-of-memory property is lost and 
we do not know how to obtain a simple expression for 
the pk (x) for the LDM. In Fig. [9] we show the normal- 
ized densities of fcth records for a few small record num- 
bers k. Apparently, already for a small drift of c = 0.01, 
there is a significant effect on the record value pdf for 
larger k. Also the width of the pdf increases as an effect 
of the drift . It seems that especially the right tails of the 
distributions become broader. In the context of temper- 
ature records these events in the tails are particularly 
interesting. 



6 Distributions of record values in European 
temperature recordings 

Based on the analytical work from the previous section 
we can now consider the values of records in observa- 
tional data. For this analysis we focused on the Euro- 
pean reanalysis data and in particular EOBS2. 

Here, the situation is a bit more complicated than in 
the case of the record rate P n (c). While the record rate 
in the LDM only depends on the normalized drift c/a, 
the values of the records depend on the standard devi- 
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Fig. 10 Upper figure: Time series of the rescaled mean 
value of temperature records that occurred in data set 
EOBS2. The full line gives the behaviour of the upper record 
value and the dotted line gives the inverse (negative) be- 
haviour of the lower record. The values of the lower records 
were multiplied with —1 to make them comparable to the up- 
per records. The rescaling is described in more detail in the 
text. The figure also contains the analytic results for no drift 
(c = 0) and the LDM with a normalized drift of c = 0.012 y~ x 
that was estimated from the observations. Lower figure: 
The same plot, but only for the two summer months of July 
and August. The analytic results for c = and the LDM case 
c = 0.012 y -1 are added. The inset shows the same analysis 
for the two winter months January and February. 



ation a itself. The standard deviations of daily temper- 
atures vary both spatially and seasonally and therefore 
it is difficult to compare the values of record breaking 
temperatures without some additional assumptions. To 
make the time series in EOBS2 more comparable we 
performed a rescaling of the data. This was done as 
follows: 

The LDM assumes that an individual series of tem- 
peratures measurements Ti,...,T n measured in n sub- 
sequent years is given by 

T k = fi + ck + a£ k , (19) 

where the are iid RV's from a Gaussian distribution 
with standard deviation one. We subtract the intercept 
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fj,o and divide by the standard deviation a to obtain the 
following time series 

1 



(T fc - fio) . 



(20) 



It is easy to see that these rescaled measurements have 
standard deviation unity around a normalized linear 
drift c = c/a. This kind of rescaling was done for all 
time series in the respective data sets individually so 
that we obtained comparable scries of rescaled mea- 
surements. This way the data is most-suitable for com- 
parison with a Gaussian LDM. If the observations were 
perfectly uncorrelated Gaussian RV's with an arbitrary 
but fixed standard deviation the record values of the 
time series after this rescaling would look exactly like 
the record values from a Gaussian LDM with standard 
deviation one. These rescaled temperatures should then 
obey the following LDM: 



a 



(21) 



It is important to notice that the ordering of the mea- 
surements and in particular the statistics of records is 
not altered by this procedure because 



T n = max [Ti, T n ] ^ T n = max 



Ti , . . . , T n 



(22) 



So if and only if T n was a record in the original se- 
ries, T n will also be a record in the rescaled series. The 
record rate P n and the record number R n are therefore 
completely invariant under this rescaling and only the 
values of the records will change according to Eq. . 



6.1 Mean values of records in a given year 

In Fig. [TUJ we analyze the mean values of a record that 
occurred in a certain year for the 30 years of observation 
in data set EOBS2. The rescaled data was first analyzed 
for each time series individually and then averaged over 
all grid points and calendar days. The upper figure gives 
the behavior of the mean value for the entire calendar 
year. The figure also shows the analytic results for the 
iid case and for the LDM with a normalized drift of c = 
0.012, which is determined from the observations. The 
behaviour of the mean upper and lower record values 
appear to be very similar. Both curves seem to have 
exactly the same shape and both lie slightly above the 
null hypothesis of a stationary climate with Gaussian 
daily temperatures. These results are not in agreement 
with the analytic results given by the LDM and do not 
show any apparent effect of slow temperature increase 
on the statistics of record values. 

The lower half of Fig. [TU] shows the same analysis 
but only for two months in summer (July and August) 
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Distribution of daily temperatures after rescaling 

Fig. 11 Estimated probability density functions of the daily 
temperature measurements in data set EOBS2 after rescal- 
ing to standard deviation unity. Estimation is based on a his- 
togram. The main figure gives the pdf's for the two summer 
months July and August, as well as the two winter months 
January and February. The dashed line is a standard nor- 
mal distribution. The pdf for the summer months is in good 
agreement with the Gaussian. The pdf for winter deviates sig- 
nificantly from a Gaussian. The inset shows the same analysis 
for the entire calendar year. Here, the rescaled distribution of 
daily temperature is again in good agreement with the Gaus- 



as well as for two months in winter (January and Febru- 
ary) in the inset. In summer, we find a much better 
agreement of the observations with a Gaussian LDM. 
The mean of the upper records increases much faster 
than the negative mean of the lower records, i.e., the 
upper records are more extreme than the lower records. 
Here, the difference between upper and lower records is 
in good agreement with the difference predicted from 
the LDM. In the two winter months (inset), the situ- 
ation is completely different. Here, the negative mean 
of the lower records increases faster than the mean of 
the upper records. It seems, that despite a significant 
positive trend in the mean values of the daily tempera- 
tures, the values of the lower records in winter are still 
more extreme than those related to the upper records. 

With these findings it is clear how to explain the 
inconsistency of the top half of Fig. [TUJwith the analyt- 
ical results for the LDM: A strong discrepancy of the 
behavior in the winter months averages out the effect 
of a slow positive increase on the record values in sum- 
mer and leads to the fact that, when averaged over the 
entire calendar year, the upper and lower record values 
behave more or less in the same way. 

To understand the anomaly in the winter months, 
we consider the seasonal variability of the pdf's of the 
daily temperature gridded values. In Fig. [TJJ we show 
the distributions of daily maximum temperatures in 
EOBS2 for two months in summer (July and August) 
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Fig. 12 Ratio between rescaled upper and negative lower 
record values in EOBS2 for July and August (red line), Jan- 
uary and February (blue line) and the entire calendar year 
(black line). The black crosses give the behavior of this ra- 
tio for a Gaussian LDM with a drift of c = 0.012 y- 1 . The 
red and blue crosses give the development of the ratio for the 
summer and winter month when sampled from the respec- 
tive observed distributions of daily temperatures in summer 
and winter. The thin black dashed line gives the behavior 
one would expect in the case of an iid Gaussian stationary 
climate. 



and two months in winter (January and February). The 
distributions were obtained after rescaling of the daily 
temperatures, as described above, so that they fitted 
a LDM with standard deviation one. The inset in Fig. 
im shows the analysis for the entire calendar year. Ap- 
parently the distribution for the winter months in data 
set EOBS2 is not Gaussian and has a much broader left 
tail. We believe that this asymmetry of the distribution 
is responsible for the anomalous behaviour of the mean 
record values in winter. 

In conclusion we find that while the ratio between 
rescaled upper and negative lower record values in sum- 
mer is larger than one, it is less than one in winter, be- 
cause of an asymmetric distribution of daily tempera- 
tures. Nevertheless, it might still be possible to describe 
the record values of the observations with a LDM, but 
we have shown that the underlying LDM for the winter 
months can not be based on a a symmetric Gaussian 
distribution. 

To further explore this asymmetry we performed 
numerical simulations based on the empirical pdf's of 
daily temperatures in EOBS2 for summer, winter and 
the entire year. In Fig. [12] we show the ratio between 
upper and lower record values in the data and compare 
them to the results from simulations with the distribu- 
tion obtained from the data as well as the predictions 
from the Gaussian LDM. 

For the entire calendar year we find that the ratio 
remains close to one for the entire observation period, 
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Fig. 13 Upper figure: Estimated pdf's of record values 
in the rescaled temperature data from EOBS2 for the entire 
calendar year. The red crosses give the behavior of the up- 
per records and the blue crosses the behavior of the negative 
cold records. The red and blue lines give the results from 
numerical simulations with a Gaussian LDM and a drift of 
c = 0.012. The black line is the distribution one expects in 
a stationary climate (c = 0) and Gaussian daily tempera- 
ture measurements. All distributions are not normalized, the 
fact that the area under the curve from the upper records 
is much larger than the area under the corresponding curve 
from the lower record is a consequence of there being much 
more upper records than lower records. Note that, for these 
rescaled temperatures, the origin corresponds to the mean of 
daily temperatures in the initial year. Lower figure: The 
same analysis for the two considered months in winter (Jan- 
uary and February) as well as, in the inset, the two months in 
summer (July and August). Numerical results are not given in 
this figure. For illustrative purposes we also plotted a Stan- 
dard Normal distribution (with standard deviation er = 1), 
which in the framework of a Gaussian LDM, represents the 
rescaled distribution of daily temperatures in the first year. 
The area under the Gaussian equals the average of the areas 
under the shown probability densities of the upper and lower 
records. 
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in contrast to a Gaussian LDM. However, if we con- 
sider the summer and winter months separately and 
compare the ratios obtained from the data with the ra- 
tios one obtains from a LDM with RV's sampled from 
the respective distributions of daily summer and winter 
temperatures, we find a good agreement with these non- 
Gaussian LDM's. In summer the ratio is strongly pos- 
itive with upper records being generally further away 
from the increasing mean value than lower records. In 
winter, due to the skewness of the distribution, the sit- 
uation is reversed and lower record values are shifting 
to be more extreme. 

6.2 Distribution of records in a certain year 

We also analyzed the full empirical pdf's of daily tem- 
perature records in the rescaled temperature data. Fig- 
ure [T3] shows the pdf's of temperature records for the 
entire calendar year (upper figure) and, again, the two 
considered months in winter and summer (lower figure) 
that occurred in the last five years of the observation 
period of EOBS2. We did not normalize the distribu- 
tions for illustrative purposes. In the upper figure we 
compared the observational distributions with numeri- 
cal results from the Gaussian LDM with a normalized 
drift of c = 0.012 y~\ 

We find that while the rescaled upper record values 
are in good agreement with the Gaussian LDM, the 
distribution of the lower record value is significantly 
broader than expected from the simulations. The pdf's 
of the upper and lower record values seem to have more 
or less the same mean value, which is in good agreement 
with our findings in the previous section and Fig. 1101 
The pdf's in this figure are not normalized, the total 
area under the curves corresponds to the number of 
records that occurred in the last five years. As a re- 
sult the upper figure shows that there were many more 
upper records than lower records in the data, but the 
shapes of the pdf's of upper and lower records look very 
similar. 

The pdf's in the lower figure are also in good agree- 
ment with the mean values in Fig. [TO] The pdf's for the 
winter months show that in winter the lower records 
were more extreme with a mean value further away from 
the mean of the daily maximum temperatures. 

Also the width of the pdf of the lower records is 
larger than in the case of the upper records, so the val- 
ues of lower records have a large inherent uncertainty. 
While there are almost no upper records more than 3cr 
away from the average behavior, there is a large num- 
ber of lower records that exceeded this barrier, some of 
them were even beyond 4er, which, given the large stan- 
dard deviation of daily temperatures in winter (up to 
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Fig. 14 Mean values of record-breaking temperatures with 
a fixed record number k in EOBS2 data using the same rescal- 
ing as for the record values with fixed time of occurrence n. 
The red lines show the time series of the upper record val- 
ues depending on k, the blue lines the corresponding series of 
the lower records. The black dashed line shows the growth of 
the mean of records with record number k for Gaussian RV's 
without drift. The insets show the same analysis for January 
and February (left inset) and July and August (right inset). 

8 K°). is a huge fluctuation. The inset in the lower fig- 
ure gives the same plot for the summer. Here, in agree- 
ment with a Gaussian LDM, upper records are more 
extreme than lower records. 



6.3 Records with a given record number 

We also analyzed the statistics of record temperatures 
in the data set EOBS2 with a given record number k. 
For this purpose we performed the same rescaling as 
described above in the context of records with a fixed 
time of occurrence n. The results of the analysis for two 
months in winter, in summer and over the entire cal- 
endar year in data set EOBS2 can be found in Fig. [TU 
The figure shows the estimated mean value of records 
in the time series of the rescaled entries Tj (Eq. ((20]) ) 
plotted against the record number k. 

This analysis of the mean record value with fixed 
record number is in good agreement to our findings in 
the above analysis of the mean record values with fixed 
occurrence time n. The analysis for the entire calen- 
dar year shows that the behaviour of the mean is again 
comparable for the upper and lower records. If we con- 
sider the two winter months January and February (left 
inset), we find again that cold records are further away 
from the mean value than the upper records and are 
therefore more extreme. In July and August (right in- 
set) it is exactly the other way around. Here the LDM 
based on symmetric, Gaussian RV's works very well and 
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Fig. 15 Estimated pdf's of record values with a given record 
number k of the rescaled daily maximum temperatures from 
EOBS2. The red lines give the pdf's of upper records with 
record numbers k = 3, 5 and 7. The blue lines give the pdf's 
of the negative lower records with the same record numbers 
k. Note that, as in Fig. 1131 the density functions are not 
normalized. 



mean record values show the expected asymmetry with 
more extreme upper records due to the positive trend. 

In Fig. [IS] we also show the estimated pdf's of record 
values for some selected record numbers k and the en- 
tire calendar year. Again, the density functions not nor- 
malized, so the area under the curves corresponds to 
the total number of records with a given record num- 
ber. Apparently, even though in all cases the number 
of upper records was higher than the number of lower 
records, the shapes of the pdf's look very similar. 



6.4 Record values in the far north 

A reason for the asymmetry of the daily temperature 
distribution in winter may be that some parts of Eu- 
rope, especially in the north, are covered by snow dur- 
ing the winter months. Snow cover decouples the atmo- 
sphere from the soil through its isolating effects. Fur- 
ther a snow surface is a very efficient black body radi- 
ator. Therefore snow covered surfaces tend to produce 
very low near surface atmospheric tempe ratures espe- 
cially under conditions of small insolation ([Geiger et all . 



ciaiiy 
19951) . 



In Fig.Ql)]we show the pdf's of records that occurred 
in the last five years of the EOBS2 data only for grid 
points north of 60°^. Apparently, here the effect of the 
asymmetry in the daily temperature pdf's on the record 
values is even stronger. The estimated density functions 
of the lower records is centered more than one standard 
deviation further away from the mean than the density 
functions of the upper records. The inset shows the pdf 
of the daily temperatures for this region, which is much 
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Measurements winter, north of 60N 

Fig. 16 Estimated pdf of record values in the rescaled tem- 
perature data from EOBS2 for the months of January and 
February in the regions north of 60° ./V. The red crosses give 
the behaviour of the upper records and the blue crosses the 
behaviour of the lower records. The inset shows the estimated 
pdf of rescaled daily temperatures in these two months and 
in the region north of 60° ./V. Note that, as in Fig. 1131 the 
density functions are not normalized. 



more heavily skewed than the pdf for the entire data 
set of EOBS2 (compare to Fig. 111"]) . 



7 Discussion &: Conclusion 

In this article we analyzed both the probability of oc- 
currence and the probability density function of record 
breaking temperatures values in Europe and the United 
States. After presenting some analytical findings on the 
Linear Drift Model (LDM) in section 2, we analyzed 
the occurrence of records in daily and monthly data 
in sections 3 and 4. In a greement with earlier work by 
Wer gen and Krug (I20T0I) we found a significant effect 
of the observed slow positive trend of temperature on 
the number of upper and lower records. This effect can 
be described by the LDM up to some accuracy. The 
effect of increasing temperatures on the monthly mean 
temperatures is clearly stronger than the effect on the 
daily measurements because of the smaller standard de- 
viation of those averages. During certain seasons in the 
United States we found up to nine times more upper 
records than lower ones at the end of the 51 years of 
observation 1960-2010. Th is also explains the findings 
of Rahmstorf and Coumou (|201lh , who studied monthly 
and annual mean temperatures and found the strongest 
effect of warming for the annual global-mean tempera- 
ture. The global mean has a very small standard devia- 
tion of a = 0.088 K° which implies that already a very 
small drift can measurably increase the upper record 
rate. 



Record occurrence and record values in daily and monthly temperatures 



15 



In section 5 we presented new results on the mean 
value of a record that occurs at a certain time in the 
LDM. We were able to obtain analytical results of the 
effect of a linear drift on the mean value of a record in 
case of Gaussian RV's in the important regime of cn <C 
a. In section 6 we compared these results to the grid- 
ded data set EOBS2. When analyzing the entire calen- 
dar year, a Gaussian LDM fails to describe the effect of 
slowly increasing temperatures on the mean record val- 
ues. The reason for this failure is a pronounced asymme- 
try of the daily temperature probability density func- 
tion in winter. In contrast the record values during the 
summer months can be well described by a Gaussian 
LDM. 

When we use a non-Gaussian LDM with RV's sam- 
pled from the daily temperature probability density of 
the considered winter months, this model is also capa- 
ble of describing temperature record values in winter. 
As a general result the lower records in winter are sig- 
nificantly more extreme than upper records because of 
this asymmetry. This leads to the unintuitive results 
that lower records of near surface temperature in the 
case of slowly increasing temperatures occur with a re- 
duced probability but once they occur they are more 
extreme than their upper counterpart due to the pro- 
nounced asymmetry of the daily temperature values in 
winter. When we consider grid points in the northern 
parts of Europe this effect is even stronger. Here lower 
records are on average more than one standard devi- 
ation further away from the mean than upper records 
and the estimated density of lower records is also clearly 
broader than that of the upper records. 

It might be interesting to further explore the effect 
of specific weather conditions especially in northern Eu- 
rope, but also in other regions, on the statistics of record 
values. In particular it remains an open question, how 
strongly the occurrence of very extreme lower records 
in winter correlates with snow coverage and other me- 
teorological events, such as winter blocking highs. It is 
however clear that the fact, that we still encounter quite 
extreme cold streaks in winter, particularly in northern 
regions, is not in contradiction with global warming and 
only a consequence of the very skew distribution of daily 
temperatures in winter. 
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